Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency
نویسندگان
چکیده
Wiener filtering is one of the most widely used methods in audio source separation. It is often applied on time-frequency representations of signals, such as the short-time Fourier transform (STFT), to exploit their short-term stationarity, but so far the design of the Wiener time-frequency mask did not take into account the necessity for the output spectrograms to be consistent, i.e., to correspond to the STFT of a time-domain signal. In this paper, we generalize the concept of Wiener filtering to time-frequency masks which can involve manipulation of the phase as well by formulating the problem as a consistency-constrained Maximum-Likelihood one. We present two methods to solve the problem, one looking for the optimal time-domain signal, the other promoting consistency through a penalty function directly in the time-frequency domain. We show through experimental evaluation that, both in oracle conditions and combined with spectral subtraction, our method outperforms classical Wiener filtering.
منابع مشابه
Radio Frequency Interference Detection and Mitigation Algorithms Based on Spectrogram Analysis
Radio Frequency Interference (RFI) detection and mitigation algorithms based on a signal’s spectrogram (frequency and time domain representation) are presented. The radiometric signal’s spectrogram is treated as an image, and therefore image processing techniques are applied to detect and mitigate RFI by two-dimensional filtering. A series of Monte-Carlo simulations have been performed to evalu...
متن کامل2D Spectrogram Filter for Single Channel Speech Enhancement
In this paper, we propose a novel approach for single channel speech enhancement by exploiting the correlation among 2D transform coefficients, which has been previously neglected by traditional speech enhancement methods. Our approach makes use of a time-frequency representation (spectrogram) of the input signal and a novel 2D spectrogram filter (2DSF)is designed to provide a good estimate of ...
متن کاملPhase-based informed source separation for active listening of music
This paper presents an informed source separation technique of monophonic mixtures. Although the vast majority of the separation methods are based on the time-frequency energy of each source, we introduce a new approach using solely phase information to perform the separation. The sources are iteratively reconstructed using an adaptation of the Multiple Input Spectrogram Inversion (MISI) algori...
متن کاملSemi-supervised NMF with Time-frequency Annotations for Single-channel Source Separation
We formulate a novel extension of nonnegative matrix factorization (NMF) to take into account partial information on source-specific activity in the spectrogram. This information comes in the form of masking coefficients, such as those found in an ideal binary mask. We show that state-ofthe-art results in source separation may be achieved with only a limited amount of correct annotation, and fu...
متن کاملNon-negative tensor factorisation of modulation spectrograms for monaural sound source separation
This paper proposes an algorithm for separating monaural audio signals by non-negative tensor factorisation of modulation spectrograms. The modulation spectrogram is able to represent redundant patterns across frequency with similar features, and the tensor factorisation is able to isolate these patterns in an unsupervised way. The method overcomes the limitation of conventional non-negative ma...
متن کامل